Pattern-based automatic taxonomy learning from the Web
نویسندگان
چکیده
The construction of taxonomies is considered as the first step for structuring domain knowledge. Many methodologies have been developed in the past for building taxonomies from classical information repositories such as dictionaries, databases or domain text. However, in the last years, scientists have started to consider the Web as valuable repository of knowledge. In this paper we present a novel approach, especially adapted to the Web environment, for composing taxonomies in an automatic and unsupervised way. It uses a combination of different types of linguistic patterns for hyponymy extraction and carefully designed statistical measures to infer information relevance. The learning performance of the different linguistic patterns and statistical scores considered is carefully studied and evaluated in order to design a method that maximizes the quality of the results. Our proposal is also evaluated for several well distinguished domains, offering, in all cases, reliable taxonomies considering precision and recall.
منابع مشابه
PATTY: A Taxonomy of Relational Patterns with Semantic Types
This paper presents PATTY: a large resource for textual patterns that denote binary relations between entities. The patterns are semantically typed and organized into a subsumption taxonomy. The PATTY system is based on efficient algorithms for frequent itemset mining and can process Web-scale corpora. It harnesses the rich type system and entity population of large knowledge bases. The PATTY t...
متن کاملPedagogical Principles of the Theories of Interaction in Distance Learning: The Study of Interaction Anderson Model in Web-based Environments
Background: Interaction is considered a necessary, and integral part in all forms of distance learning and web -based learning environments. That how much technologies can support and help interactions is one of the most important issues that should be considered in the web-based environment. These interactions seek to engage students with other students, teachers and also non-human contex...
متن کاملA Graph-Based Algorithm for Inducing Lexical Taxonomies from Scratch
In this paper we present a graph-based approach aimed at learning a lexical taxonomy automatically starting from a domain corpus and the Web. Unlike many taxonomy learning approaches in the literature, our novel algorithm learns both concepts and relations entirely from scratch via the automated extraction of terms, definitions and hypernyms. This results in a very dense, cyclic and possibly di...
متن کاملCombining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)
Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...
متن کاملGeoreferencing Semi-Structured Place-Based Web Resources Using Machine Learning
In recent years, the shared content on the web has had significant growth. A great part of these information are publicly available in the form of semi-strunctured data. Moreover, a significant amount of these information are related to place. Such types of information refer to a location on the earth, however, they do not contain any explicit coordinates. In this research, we tried to georefer...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- AI Commun.
دوره 21 شماره
صفحات -
تاریخ انتشار 2008